DOCUMENT RESUME 



BD 225 058 

AUTHOR 
TITLl\ 

SPONS AGENCY 
PUB DATE^ 
GRANT 
NOTE 



PUB TYPE 



EDRS PRICE 

'descriptors 



CG 016 406 

Roitman, David B.; Mayer, Jeffrey P. 

Fidelity and Reinvention in the Impleraentatxon of 

Innovations . 

National Science Foundation, Washington, D.C. 
Aug 82 

NSF-iSI-7920576-01 

21p.: Paper presented at the Annual Convention of the 
American Psychological Association (90th, Washington, 
.DC, August 23-27, 1982). 
Reports - Evaluative/Feasibility (142) — 
sXeches/Conference Papers (150) 

MFOl/PCUl Plus Postage. 

Adoption (Ideas); Elementary Secondary Education; 
Evaluation Methods; *Innovation; *Organizational 
Change; *Program Development; *Progiam Effectiveness; 
♦Program Evaluation; Social Science Research 



ABSTRACT i 

The field of social innovation policy can presently 

be^seen as divided into two opposing camps: pro-fidelity and 
pro-adaptation. The former conceptualizes innovations as consisting 
of a number of^relatively well specified components, and argue that 
riqorously developed, evaluated programs should be implemented with 
close correspondence to validated models. The latter argues that 
differing organizational contexts and practitioner needs demand 

on-site modification. To provide empirical evidence about the^ 

pro-fidelity position,, seven innovative social programs developed and 
disseminated nationwide by various public sector organizations 
(schools, city agencies, prisons, courts) using federal funds were 
studied. Methods for measuring program fidelity and effectiveness 
across sites were developed. Results supported J^e two pro-f idelity 
assumptions. Four of the seven programs exceeded the acceptable level 
of 'fidelity, while thfe remaining three were in the acceptable range. 
The second assumption was supported by a significant correlation 
between fidelity and effectiveness. However, despite the overall 
support, there was considerable cross-program variation. 
(Author/JAC) 
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J^idelity and Reinvention in the 
Implementation of Innovations 

The. field of social innovation policy research can presently be seen 
as divided into two opposing camps: "pro-fidelity" vs. "pro-adaptation" 
researchers (Fullan & Pomfret, 1977). The f,ormer conceptualize innovations- 
ajs consisting, of a number of relatively well specified components, and argue 
that rjgorou^Jy developed and evaluated programs should be implemented with 
/close correspondence to the validated models or else' suffer the consequences 
of "dilution" (Boruch & Gomez,' 1977; Calsyn, TornetzKy, & Dittmar, 1977). 
Dilution is expected to lead in most cases to reductions in outcome effective- 
ness. A more moderate fidelity position was taken by Hall and Loucks (1978) 
who argued that adaptation is acceptable up to the "zone of drastic muta- 
tion," beyond which the innovation \oses its integrity. Therefore, infor- 
mation concerning the parameters of the drastic mutation zone for any inno- 
vation is of crucial' importance to policy makers, disseminators, and users. 

On the other hand, "pro-adaptation" researchers and practitioners argue 
that differing organizational, contexts and practitioner needs demand on- 
site modification, virtually without exception (Berman & McLaughlin, 1978; 
House, et al. ," 1972). For example, according to,6ephart, 

A specific product or procedure is developed for a particu- 
lar purpose or function (but). .. .typically, pui'-poses or 

functions differ from setting to setting. ... (and) although 

the ideal system' would be one which had the needed nun.ber and 
types of components universally required. .. .we seldom know 
enough in a design effort to create all the component parts 
(1976, ^p. 5-6). 

One implication of the pro-adaptation perspective is that the freer 
users are to adapt programs to their local needs, the more likely that the 
program which is adopted will last. A second implica.tion is that the more 
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the program is nwdified to suit the site, the more likely it is to achieve 
the outcomes desired by users. A third, even more radical implication of 
this perspective, is that instead of channeling initial program development 
funds to specific developer sites, funding should instead be devoted tp 
building the capacities of the local sites to develop innovations independently. 

Although, the pro-adaptation position has attracted an increasing number 
of adherents in recent years (Datta, 1981), the research foundations of the 
principal supporting studies are somewhat tenuous. For example, the widely- 
cited RAND, report on Federal programs supporting educational change (Benrwrn 
&' McLaughlin, 1978) found three dominant patterns for implementation: co- 
optation -(when "the staff adapted the project. . . without any corresponding 
changes in traditional institutional behavior or practices"), mutual adapta- 
tion (when ^th project and setting were changed) and nonimplementation. 
The RAND researchers reported that "mutual adaptation was the only process 
leading to teacher change," and "had a better chance of being effective'ly 
implemented" than coopted projects. In addition, they reported a striking 
■absence of high fidelity adoption. 

A closer look at the Rand methodology reveals the absence of any bona 
fide measure of program fidelity. .The RAND researchers used as their 
implementation outcome measure "the extent to which projects met their own 
goals, different as they might be for each project" (Berman & McLaughlin, 
1977, Vol. VII, p. 50). Therefpre, their implementation measure was biased 
to reflect adaptation, rather than fidelity', there was no conclusive way 
to determine the* extent that these programs were modified or what components 
were changed. Additional doubts concerning the 'W\ND conclusions were 

raised by Datta (1981), who noted, th^it the "programs"" examined were for 

i 

the most part loosely-defined policy statements, rather than highly speci- 
fied social programs. 



In fact, in a recent article Berman (1980) has advanced the fidelity- 
adaptation debate considerably by proposing a normative contingency model 
^or implementation strategy. This contingency model implies that different 
strategies for implementation are most appropriate for different situations 
(i.e., broad policy statements vs. explicit programs). According to Berman 

There is no universally best way to implement policy. Either 
programmed (pro-fidelity) or adaptive implementation can be 
effective if applied to the appropriate policy situation. 
....Folicy situations are often so complex that a mix of 
' programmed and adaptive strategies might be more effective 
than a simple choice between the two. 

Berman suggests five, situe.tional parameters to be considered when 
designing an implementation strategy: (1). scope '-of change (incremental 
or major); (2)' ceKainty- of technology or theory; (3) amount of conflict 
over policy goals and means; (4) structure of the institutional setting 
(tightly vs. loosely coupled); and (5) the environment's stability. He 
argues that relatively structured conditions support the use of programmed 
(fidelity) approaches, while unstructured situations imply the use^of 
adaptive strategies. ' 

However, despite the clear good sense of Berman *s' present position, 
few decision-makers ornrewarchers seem to have adopted it. Instead, as 
Serman notes, "advocates on both sides^ seem to be thr/owing down the 
gauntlet," (p. 206) and a policy shift on the federal level -towards a pro- 
adaptation position has already^begun (Datta, 1981). 

The present research was designed in part to provide empirical evi- 
dence concerning the viability of the pro-fidelity position. In order to 
do so, two basic assumptions of this position were examined: 

1) Programs which have been operational ized in rela,tively unambiguous 
and concrete terms can actually be implemented with acceptable fidelity at 
adopting sites; and 
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2) Programs which have been demonstrated to be effective in devel- 
oper demonstration- projects will also produce positive outcomes at adopt- 
ing sites if implemented with acceptable fidelity. - ; 

Methodology and results bearing on these assumptions will be dis- 
cussed- in this paper. Before turning to that discussion, the .concept of [ 
reinvention will be briefly examined. Future reports will di?cuss the 
empirical relationship of reinvention with fidelity and effectiveness. 

Rainvsntion ^ ^ 

The term "reinvention" was introduced by Rogers and his colleagues 
(Eveland, et al.i 1977; Rice & Rogers, 1979) to capture the flavor of an| 
active process of change at user sites. "Reinvention" brings to mind the 
phrase "Not Invented Here," a conmon slogan used in both public and private 
sector organizations to describe the rejection of outsiders' ideas simply 
because they originated outside the organization. Such ideas must be 
"reinvented"' to counter the "Not Invented Here" syndrome. ^ 

However,, despite the potential usefulnes-S of the term "reinvention," 
the research by Rogers and his associates may not be general izable to 
modified RD&D innovations, since the programs examined by Rogers and his 
colleagues were disseminated with low component specificity and explicit- 
ness. Such programs may behave quite differently from programs which are 
more well-in-hand (Gephart, 1976). It is therefore fruitful to consider 
what the concept of reinvention may add to the conceptualization of RD&D 
innovations. Perhaps the concept of fidelity alone more parsimoniously 
accounts for the salient phenomena (Taylor, 1980); and reinvention is 
simply an unnecessary synonym for low-fidelity implementation. 



In disagreenent with Taylor, and following Larsen and Ag'arwala- 
Rogers (1977), we would argue that at least two alternative definitions 
for reinvention are possible, each of* which could distinguish "reinvention" 

. (both conceptually, and operationally) from lack-of-fidelity. For example, 
it is useful to contrast program changes which are relatively creative 
(adding something new to the program) vyith relatively uncreative changes. 
The term "modification" connotes the latter, while, the sense of activity 

"and creativity implied by "reinvention" suggests it may best be used to 
refer to the fomner type of change. 

Note that this conceptualization of reinvention assumes tf^at the pro- 
gram under studiy was disseminated as a set of relatively concrete components 
and that the component set was sufficiently complete to withstand the rigors 
of transfer without requiring major additions at the adopting site. If 
these assumptions are not met; the innovation virtually requires "creative 
adaptation" at^Ke adopting site, and the usefulness of a concept which 
distinguishes creative from uncreative adaptation is thus questionable. 

- A second possible definition, for reinvention could limit use of the 
tertn to instances where changes in the program were made in a deliberate 
or obvious effort to defend the innovation against the "Not Invented Here" 
syndrome, by giving the program a character unique to the adopting 
organization. 

These- two conceptualizations differ from that of Eveland, et al. 
(1977); Rice & Rogers (1979); and Larsen & Agarwala-Rogers (1977) who use 
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the term reinvention as a catch-all for "the ways in which an innovation 
is changed during the process of implementation. .. (including such notions 
as) Fidel1ty...Modification. .. Expansion... Extent. . .Flexibility. . . 
Complexity... (and) specificity" (Larsen & Agarwala-Rogers, 1977, pp. 
136-137). These two potential definitions also differ from the usage of 



Rorbaugh and Quinn (1980) who defined reinvention in terms of changing the 
purpose of an 'innovation. Noted also that both definitions could describe 
changes which actually enhance the fidelity of the adopted model. Eadh of 
these usages is thus distinct conceptually from "lack-of -fidelity." 

Rather than attempting to define the concept a priori, the strategy 
used in the present study was to collect case study.notes on every variation 
that differed in any way from the variations listed in the fidelity instru- 
ment. These qualitative data will be content-analyzed to determine the 
most comprehensive and meaningful definition of reinvention. Content 
analysis will also be used to categorize instances of reinvention and 
determine 'the frequency of occurrence of different types of reinvention. 

Method 

Recall that the empirical ground covered by this paper is limited to 
the examination of the two assumptions stated above, which may be summar- 
ized as: (1) Programs-can be implemented with acceptable fidelity at adopt- 
ing sites; and (2) The greater the fidelity, the greater the effectiveness 
of the implemented program. ^ 

These assunptions were examined within the context of the three-phase 
project described in the first two symposium papers. Briefly reviewing this 
context, eight programs developed and disseminated using the "modified RD&D 
approach" were selected. Four of these programs were educational innova- 
tions, selected from the. catalogue of the National Diffusion Network, 
Department of Education (Emrick, et al.^, 1977), and four were criminal 
justice innovations disseminated by the Exemplary Projects Program of the 
Justice Department's National Institute of Justice. (The National Institute 
' Host Program, 1979). In order to select a subset of the many NDN and NIJ 
programs for study, the following two criteria were used: (1) potential 
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for at least 20 site adoptions per program (-to provide sufficient stat- 
istical power to detect significant relationships); and (2) "organization- 
wide" quality of each program. This latter criterion was required since 
the research issues concern organizational rather than individual innova- 
tion implementation. Materials, for each innovation disseminated by the 
NDN and the Exemplary Projects Program were independently rated on the 
selection criteria. Ratings were then discussed, by the entire research 
group, resulting in the. program, selections. Following the first two 
phases of the project, one of the four educational programs was dropped 
from the study, since the data collected in the first two phases had 
revealed that the program did not adequately meet the original selection 
criteria. Brief descriptions of the seven programs investigated in the 
third phase are included in Figure 1. 
Sample 

Reviewing the sampling ^rategy de§jcribed in the previous symposium 
papers, listsV organizations which had contact with program developers 
were obtained. Adopters were randomly sampled from these lists. The 
unit pf analysis for Phases Two and Three was the organization in which 
the. program was housed, in some cases, this differed from the organization 
which made the adoption decision, since implementation entailed creating 
a new organization or subcontracting to another agency. For example, in 
one case a crime prevent.ion program was adopted by a police department and 
later moved to the town's Bureau of Neighborhood Associations. In several 
other cases, alternative schools were created by district school boards 
to administer Experienced Based Career Education or Focus programs. 

A subsampte of the Ph^se Two respondent organizations were selected 
for inclusion in Phase Three. Ten organizations froft each of the seven 
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Innovations were chosen to be ^ite visited. This resulted in a Phase 
Three sample size of 70 organizations.. Three criteria influenced this 
selection process. The most important criterion requii*ed including or- , 
yduUations that exhibited a range, of fidelity scores which were calcu- 
lated from Phase Two data. ,Thus, for each innovation, three~ organizations 
were selected from above and below one standard deviation from the mean 
and four were selected from the tnid'-range. This resulted in ten sites 
-that varied from high to low on fidelity. The second criterion was the 
length of a program's existence in the organization. If possible, pro- 
grams that had^been in existence for at least two years were selected in 
order to conform to the requirements of studying routinization. Finally, 
a broad geographic distribution was sought. In sum, 70 organizations 
that exhibited variabtMty on fidelity/ had been ifi place for two years, 
and were located across the entire continental U.S. were selected for 
site visits. 

Measuring Program Fidel itVi^ 

The five step model for developing a fidelity instrument proposed by 
Hall and Loucks C1978) was utilized, with, several modification to suit 
the scope and purpose of the study. 

Preliminary identification of innovation components . The purpose of 
the present stucjy involved examining the viability of the pro-fidelity 
position. Therefore, rather than attaining a comprehensive description 
of the innovation in practice by interviewing users as well as developers, 
it was decided to limit the sources, for component identification to those 
individuals, who were involved with the program before it had an oppor- 
tunity to be modified or reinvented at adopting sites. Consequently, the 
sample of respondents for component identification was limited to several 
actors* at the'developer site. 
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Each developer site was visited by two n'.embers of the project staff. 
Interviews with several staff members at each site were tape- recorded. 
All written materials and tapes for each innovation were independently 
content-analyzed by two staff members in order to identify innovation 
components. The components .were selected/to conform to the following 
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criteria: (1) The component should be an observable activity, -material 
or facility". If not observable, the implementation of the ^component 
should be verifiable through interviews with staff members and clients 
of the implementing organization. (2) Each component should be logi- 
cally discrete fropi other components, and wherever possible, should not 
depend on the implementation of other components. (3) Each component 
should be relatively "innovation-specific"; practices whicl:^ are common 
to o^hen programs in the organization^ should not be considered components. 
*'(4) The list of components should e;<haustively describe the innovation. 

\ Preliminary identification and scaling of variations . The method- 
ology/pioneered by Hall and his associates for measuring implementation 
requires the identification of "variations'* for each of the innovation's 
components. These variations^ are scaled as "ideal," "acceptable," or 
"unacceptable." Thus> fidelity is not measured simply by the'-number of. ^ 
components, implemented at the user site, but can be represented by a 
"fidelity scor,e" which reflects th,e extent of component variation af 
the site. 

Due to the limited numb^ of adopters of each innovation, researchers 
who had visited the original innovation sites generated variations 
(rather than obtaining variations through extensive interviews of adopters). 
Subsequent modifications of variations were based on ih-depth conversations 
with the innovation developers and pilot interviews with adapters. In 
generating variations , the researchers attempted to list discrete, ob- 
servable, and^ quantifiable alternatives. 
» 
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Feedback interviews with developers . In order to check the accuracy 
of^he preliminary identification of coniponer>ts and variations, staff 
ipenbers of developer organizations who had been interviewed previously 
were re-contacted. This second contact involved mailing a list of compp-- 
-.nents and variations to each staff member. The component variations 
(i.e., e^ch component-specific set of ideal-acceptable-unacceptable , 
variations) generated by the research team were reviewed by developers 
with the following questions in mindK "Are these variations realistic? 
Do they describe the possible implementation of the model program com- 
pletely, or are there other important variations which should be included? 

y 

Are the researchers correct in their labeling of variations as ideal, 

acceptable, or' unacceptable?" 

Feedback from developers concerning the preliminary identification 
\ • ' 
of components and variations was thus obtained, and appropriate modi-fii- 

cations and additions were made to the liits. The result was a list of 
components and scaled variations for each innovation. * . ' 

*^ Data collection . * The procedure for collecting actual, data on imple- 
mentation involved two pairs of researchers traveling to the sites^selected 
for the on-site sample. Each peri:r 'visitc-d 35, sites, and spent two day.s at' 
.each site. Data collection consisted of interviews with respondents from 
several role positions at each site, pbservations of pertinent activities 
. and facilities (e.g., blockrwatch meetings, arbitration hearings, juror 
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orientations, interactions among teachers, aides, and students, etc.), 
and examinations of archival records. In addition, information concerning 
progrfim effectiveness was collected during site visits as well. Immediately 
following each, site visit, the researchers discussed their notes and arrived 
at a consensus. decision on the fidelity of each component (ideal, acceptable 
or unacceptable). 
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Reliability and validity . Inter-rater reliability was checked at 
13 of the 70 site visits (18%), with each pair of researchers conducting 
one reliability check for each program. (One reliability check was 
missed due to logisticarproblems. ) During the checks, both researchers 
interviewed the same respondent's, observed the paLe activities, and examined 
the same documents. Forms were coded independently, and results' were coin- 
pared. Usdng, the percentage of exact agreement method, an overall relia- 
bility of .81 was achieved. At ^the sites which were not included in the 
vreliability sample, the researchers interviewed, observed, and examined, 
different data. At these sites they also coded the data independently, 
but discussed their reasons for coding before arriving at a consensus on 
final coding decisions.. At sites which served as reliability checks, .this 
.consensus procedure occurred following the determination of the site'^ 
reliability. 

Foliowijrig data collection, a rational -empirical scaling method 
(Jackson, 1970) was used to determine the best sub-scale structure for 
the fidelity instrument. These scales are discussed in the Results 
section, below. 

The most serious valijdjity issue concerned agreement among different 
sources of data (.e.g., resDondents from different organizational roles, 
observations of behavior and examination of archival^records). For most 
program components, several data sources were used, by the researchei^^^ in 
determining their rating of a particular componeiik|s fidelity score. Con- 
sequently, the extent of agreement between each source and the' researcher's 
consensus decision for each component was feli to reflect the extent to 
which the researchers were truly capturing the program fidelity. Rather 
than examining only a sample of these agreements, the percentage agree- 
roent between the researchers consensus ratings and the various data sources 
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for each component were computed for all components on which multiple 
sources of data wer^ available (7066 out of 9214 total data sources; or 
77X, were multiple Sources). The overall percentage agreement between 
these data sources and respondent's ratings was .96. 
Measurement of Program Effectiveness 

.Recall that each of the seven social programs had been evaluated prior 
to dissemination by the NDN and Exemplary Project Program. The same out- 
come witeria that were used in- these evaluations were also employed in 
the present study. These criterion measures are listed in Figure 2. 

There was' a great deal of variation among sites regarding the quality 
and format of these measures (e.g., some were aggregated in yearly reports, 
while others were available only in weekly tabulations). There was als9 
variation in the lime periods for which data was available; many sites 
had data for a time period contiguous to or immediately preceding the site 
visit, but some sites only had year-old data available. In addition, some 
sites did not havje outcome data on all of the criterion indices or had no 
ev.ldence of program effectiveness available. Given these limitations on . 
^he quality of the outcome data, and given the requirement of obtaining 
effectiveness scores, which could be translated into a common metric for 
analyses across programs, a ranking procedure was used. This involved 
the two site- visit teams re-rpairing so that each new teapi had first-hand • 
experience with all siteL The teams then reviewed the available archival 
outcome data for-each program, and final decisions were made on the 
specific indices to.be considered as outcome criteridn measures for each 
program. These outcome criteria are listed in Figure 2. The teams then 
independently ranked sites within programs, resulting ih two sets of 
. rankings for 65 sites (outcome* data was unavailable for 5 sites). Each 
of the seven sets of program rankings were correlated to test the procedure 



reliability. An overall rank-order correlation of .90 (Spearman Rho, 
with Spearman-Brown formula used to correct for the two sets of raters) 
was obtained, indicating that the procedure was highly reliable. 
I Resul'ts 

The first set of data analyses concerned the distributions of fidelity 
scores among programs and social policy area%. Figure 2 shows the fre- 
,quency distributions of the raw fidelity average-item scores. Note that 
four of the seyen programs clearly scored in the acceptable range, with 
a mean fidelity average-item score across sites greater than one (0 = 
unacceptable,. 1 = acceptable, .^ = ideal). Of the remaining three programs, 
means of .944, .861, and .860 indicate scores close to the acceptable 
level. It should be noted that of the four scores which exceeded the ac- 
ceptable value, two were from the educational policy area and two were from 
the criminal justice policy area. 

An analysis of variance was performed to test for differences between 
program means. Table 1 shows' that the differences among 'program fidelity 
means were signi-ticant "(F=11.45(6,63), p<.001, cj2=.4723). Table 1 also 
shows the results of a Scheffe post-hoc procedure, indicating the between 
program differences responsible for the overall significant F value. These 
results must be qualified by the fact that the use of analysis of variance 
in this situation rests on the following assumptions: (1) The fidelity 
metric is an interval -level scale, and (2) that the fidelity score for one 
program is measured 6n the same scale' as that of another program. Since 
the seven programs are implemented in different organizational contexts 
(e.g., elanentary schools, courts, prisons, city agencies, etc.), and 
since the ftd|JI1ty measure 1s to some extent program-specific (e.g., the 
number of components per program ranges from 36 to 103), one might argue 

4, 

that seven different measures are actually employed. However, this can be 



viewcsd as a conservative position; a more liberal view would' hold that 
since identical procedures were used for the development of each program's 
component set, the fidelity measures are fairly comparable across 
programs. 

Standardization of scores is the logical solution to this problem. 
However, since this process equates means and variances, it defeats the 
purpose of an analysis of variance. Standardization can, however, be 
meaningfully used in the types ^of correlational analyses required to 

examine the second pro-fidelit| assumption, and was thus employed to 
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increase the meaningfuliiess of cross-program comparisons,. / 
Recall that the second assumption" concerned the relationship between 
program fidelity and^ogram effectiveness. This assumption was examined 
both across^program and wi thin-program. The across-prog ram analysis 
involved examining the correlation between standardized fidelity scores 
and normalized program outcome ranks. The Pearson correlation produced 
by this analysis was r=.3797, n=65, p=.001, indicating a fairly strong 
ana significant relationship between fidelity and effectiveness" across 
programs. In order to obtain a better estimate of the true relationship 
between fidelity and effectiveness, this correlation was corrected for 
attenuation by using the respective reliability estimates: This resulted 
in a corrected correlation of .4447. The within-prog ram correlations 
-between overall fidelity and effectiveness are listed in Table 2. 

In order to further analyze the relationship between fidelity and 
outcome effectiveness, a rational -empirical scaling procedure was employed 
to determine the sub-scale structures of each program's component set. 
The criteria used in constructing sub-scales included maximizing rational 
interpretability and internal consistency. of sub-scales, and minimizing 
the correlations between sub-scales and the extent to which items from one 
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sub-sc«l\s correlated with other sub-scales. The resulting sub-scales ' 
were then correlated with the normalized outcome rankings within each 
program/ Significant sub-scale— effectiveness relationships are reported 

4 

in Table 3. * 

. - _ Discussion 

The results of the present research provide support for two pro- 
fidelity assumptions: (1) Programs which have been operationalized in 
relatively unambiguous, complete, and concrete terms can be implemented 
with acceptable fidelity at adopting sites; and (2) Explicitly opera- 
tionalized programs will be effective at adopting sites if they are 
implemented with reasonable fidelity. Regarding the first assumption, 
four of the seven programs exceeded the acceptable level of fidelity, 
while the remaining three fell within the acceptable range. The second 
assumption was supported by the significant correlation of .3797 (65),. 
p»*. 001 (.444.7 when corrected for attenuation) between fidelity and effect- 
iveness. . . 

These results contrast with much of the implementation literature 
which conveys, the impression that implementation with fidelity is generally 
impossible, dangerous, or foolhardy (e.g*. , Farrar, DeSahctis, & Cohen, 
1979). However, despite the overall support g?ven the assumptions, there 
is considerable acrpss-program variation with regard to both assumptions. 
A plance at Figyre 3 shows a range between .860 and 1.383 for fidelity 
average-item scores. Table 2 reveals, Pearson correlations between, overall 
fidelity scores and effectiveness ranging from .11 to .78. 

It is beyohd the scope of this paper to explain this variation. How-, 
ever, it should be noted that in addition to what is reported in this 
symposium, data on. several potential predictors of fidelity was collected. 
These predictors Included the extent of contact with program developers, 
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the extent of institutional support, staff rewards inherent in the program, 
and compatibility of the propram with the adopting organization. And, it 
should be recalled that this study also collected data on program modifi- 
cations which did not neatly fit into the fidelity measure. These modifi- 
cations will be examined to develop an empirically-based definition of the. 
concept of "reinvention." Finally, recall the situational parameters 
suggested by Berman (1980) as factors affecting the appropriateness of 
using a fidelity and/or an adapts tf on-oriented implementation strategy. 
These parameters cou>d certainly affect fidelity scores. Hopefully, future 
empirical studies will uncover the relative importance of these parameters. 

In sum, variation in fidelity and effectiveness scores might be ex- 
plained by the variable relationships diagrammed in Figure 4. Note that 
four'of Beman's five ?^tuational parameters are included as exogenous 
variables. (The fifth parameter, conflict over goals and means, overlaps 
with the predictor Institutional Support.) Note also that fidelity is 
conceptualized* as an. intervening variable which moderates the effectiveness 
of the program (Tdrnatzky, 1981),^ It is suggested that future research in 
this, area considers "the general approach indicated by this model, and the 
contingency approach suggested by Berman (1980), as fruitful guides for 
designing studieij of program implementation. 
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\ Figure 1 ■ . 

Innovative jSocial Programs Selected for Study 

Education 

1 Haul's ^HpId One Student to Succeed)— A diagnostic, prescriptive, tutorial 

prSgr^ tor children in grid es zj. Tutor, are c™it^^^^^^^ 
and cross^age students. The program Includes "pulling out" students from 
their regular classes at least % hour per day. 

2 FRrF#gxDerience Based Career Education )- This program provides career experi- 

• enceS?Se Sf school at volunteer field sites for the ^i^dent. ^ ach career 
site is systematically analyzed for its educational potenti a . Jtudenjs 
career and academic abilities and interests are systematically assessed. 
Individualized learning plans which integrate career experiences and academic 
learning are utilized. Programs typically take students from grades ll-l^, 
although some also accept students from 9-10. 

3. FO CUS (Focus bissemination Project)- A "school within a school'; disaffected 
junior and senior high school studen ts.. All students are required to partici- 
pate in a support/problem solving group of 8-10 students and one teacher, 
elhlv Sral cbStracting and a governing board with student representatives are 
important features.. Classes in the Focus program involve individualized, self- 
paced instruction. 

Criminal Justice - , 

4. nnOT fOnePav/One Trial )r -A jury management system that calls in a certain number 
of Dotential jurors per d ay. Potential jurors come in for that day and if pot 
sIlK to se^Jf in a trial -have completed their obligation.. Jurors who are 
selected serve the length of the trial. 

5. CAP (Cotnnunitv Arbitration PrQjectV nluvenile^offenders are sent to a formal 
arbitration hearing run by the court intake division, rather than to courts. 
S^veniles havf Jh^specific consequences of their actions explained to them 
with oarents Ind victims frequently present at hearings. Youths are then • 
?y5icSlirgivln a nSS of hours of 'informal supervision usually involving 

. work in thi community. Restitution is also frequently required. . . 

6 SCCPP (Seattle rnrnn iinity Crime Prevention Proqram)- This program is a three 
Dtose^tSck at residential burglary. It involves the setting up of a neighbor- 

watch ?hrS5Sh proactive targeting of neighborhoods, property marking . 
9. n?»' inventory, and::home security inspections. 

7 lirDor ^Mnntnomerv Countv Pre-Release C enter )— Involves the setting up of a 

^' fesfde&uliK^^^^ This facility should e in the 

Snity from which most of the inmates are drawn. Inmates are. encouraged to 
SSTso thav!hSy will have a job when they are released. Counseling, social 

• awareness instruction, and behavioral contracting are also part of this pro- 
gram. 
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